JPA 批量插入较大数据解决性能慢问题

jpa 表映射@Table 下对主键使用序列，postgre支持创建序列，可以使用，其他数据源不一定。第一种：自己编写写入逻辑，引入 EntityManager entityManager，代码如下。这样做的逻辑saveAll()不需要判断isNew,直接走em.persist(entity);以上是jpa源码，所以导致写入数据很慢。因为for遍历一行一行数据写入，而且还要判断；第二种：不需要自

xz_4321

3573人浏览 · 2023-06-29 16:32:43

xz_4321 · 2023-06-29 16:32:43 发布

JPA 批量插入较大数据解决性能慢问题

使用jpa saveAll接口的话需要了解原理：

	@Transactional
	@Override
	public <S extends T> List<S> saveAll(Iterable<S> entities) {

		Assert.notNull(entities, "Entities must not be null!");

		List<S> result = new ArrayList<>();
		// 使用for循环遍历
		for (S entity : entities) {
			result.add(save(entity));
		}

		return result;
	}

	@Transactional
	@Override
	public <S extends T> S save(S entity) {

		Assert.notNull(entity, "Entity must not be null.");
		// 每条数据都会查询之后 做下判断
		if (entityInformation.isNew(entity)) {
			em.persist(entity);
			return entity;
		} else {
			return em.merge(entity);
		}
	}

	public boolean isNew(T entity) {

		ID id = getId(entity);
		Class<ID> idType = getIdType();

		if (!idType.isPrimitive()) {
		     // 如果id有值，则认为不是新数据，则更新操作，否则就是写入操作
			return id == null;
		}

		if (id instanceof Number) {
			return ((Number) id).longValue() == 0L;
		}

		throw new IllegalArgumentException(String.format("Unsupported primitive id type %s", idType));
	}

以上是jpa源码，所以导致写入数据很慢。因为for遍历一行一行数据写入，而且还要判断；

以下为亲测两种解决方案：

第一种：自己编写写入逻辑，引入 EntityManager entityManager，代码如下
批量写入一批数据。一次事务提交一批。


    @Value("${spring.jpa.properties.hibernate.jdbc.batch_size:1000}")
    private int batchSize;

    @PersistenceContext
    private EntityManager entityManager;

    public <T> void batchInsert(List<T> list) {
        if (!ObjectUtils.isEmpty(list)){
            for (int i = 1; i <= list.size(); i++) {
                // 写入操作
                entityManager.persist(list.get(i - 1));
                if (i % batchSize == 0) {
                    entityManager.flush();
                    entityManager.clear();
                }
            }
            if (list.size() % batchSize != 0) {
               //flush() 同步持久上下文环境，即将持久上下文环境的所有未保存实体的状态信息保存到数据库中。
                entityManager.flush();
               //clear() 清除持久上下文环境，断开所有关联的实体。如果这时还有未提交的更新则会被撤消。
                entityManager.clear();
            }
        }
    }

    public <T> void batchUpdate(List<T> list) {
        if (!ObjectUtils.isEmpty(list)){
            for (int i = 1; i < list.size(); i++) {
                entityManager.merge(list.get(i - 1));
                if (i % batchSize == 0) {
                    entityManager.flush();
                    entityManager.clear();
                }
            }
            if (list.size() % batchSize != 0) {
                entityManager.flush();
                entityManager.clear();
            }
        }
    }

第二种：不需要自己编写逻辑，使用jpa saveAll()方法
开启JPA批处理
在这里插入图片描述

jpa 表映射@Table 下对主键使用序列，postgre支持创建序列，可以使用，其他数据源不一定。

    @GeneratedValue(strategy = SEQUENCE, generator = "seqGen")
    @SequenceGenerator(name = "seqGen", sequenceName = "seq", initialValue = 1)

这样做的逻辑saveAll()不需要判断isNew,直接走em.persist(entity);

两种的性能差不多，记录下

亚马逊云科技技术品牌专区

更多推荐

STM32节点移植lorawan协议连接腾讯云物联网开发平台（IoT Explorer）

STM32移植lorawan协议连接腾讯云物联网开发平台（IoT Explorer）前言前言在移植协议之前，先给大家科普一下Lora 和 lorawan 的区别。LoRa 是LPWAN通信技术中的一种，是美国Semtech公司采用和推广的一种基于扩频技术的超远距离无线传输方案。这一方案改变了以往关于传输距离与功耗的折衷考虑方式为用户提供一种简单的能实现远距离、长电池寿命、大容量的系统，进而扩...

亚马逊云科技技术品牌专区

物联网主机E6000引领工业自动化的新篇章

亚马逊云科技技术品牌专区

搞 IoT 物联网，你居然要懂这么多种协议...

物联网协议是指在物联网环境中用于设备间通信和数据传输的协议。根据不同的作用，物联网协议可分为传输协议、通信协议和行业协议。传输协议：一般负责子网内设备间的组网及通信。例如 Wi-Fi、Ethernet、NFC、 Zigbee、Bluetooth、GPRS、3G/4G/5G等。这些协议能够确保在网络上传输的数据的安全性和可靠性。通讯协议：主要是运行在传统互联网TCP/IP协议之上的设备通讯协议，负责